Average sentence length |
---|
102.1173 |
Standard deviation |
---|
53.12418334447676 |
sentence length | percentage |
---|---|
10 ... 19 | 3.7150 |
20 ... 29 | 12.6870 |
30 ... 39 | 15.7160 |
40 ... 49 | 15.6420 |
50 ... 59 | 13.7100 |
60 ... 69 | 10.9900 |
70 ... 79 | 8.4320 |
80 ... 89 | 6.3640 |
90 ... 99 | 4.7030 |
100 ... 109 | 3.4170 |
110 ... 119 | 2.4280 |
120 ... 129 | 1.7640 |
130 ... 139 | 0.4320 |
The length of a sentence can be measured in the number of characters ore the number of words. These two possibilities will be addressed in this subsection and the next. We find the average length and the percentage of sentences for length ranges of five characters.
The plot shoes the corresponding length distribution.
Clearly average sentence length depends on the text genre. But for a fixed genre, average sentence length is one of the classical language parameters.
If the sentence length is limited to 255 characters, the distribution ends abruptly on the right. Otherwise we would have a smooth continuation.
Average:
select avg(char_length(sentence)) from sentences;
Table data:
SELECT @all:=count(*) from sentences;
select concat(5*round(char_length(sentence)/5)-2," ... ",5*round(char_length(sentence)/5)+2) as le,100*count(*)/@all from sentences group by le order by round(char_length(sentence)/5);
Compared to the next subsection, the plot is very serrated. What is the reason?
4.2.2 Length of sentences in words